Your browser doesn't support javascript.
Show: 20 | 50 | 100
Results 1 - 4 de 4
Filter
Add filters

Database
Main subject
Language
Document Type
Year range
1.
biorxiv; 2022.
Preprint in English | bioRxiv | ID: ppzbmed-10.1101.2022.01.21.475953

ABSTRACT

As the COVID-19 pandemic continues to affect the world, a new variant of concern, B.1.1.529 (Omicron), has been recently identified by the World Health Organization. At the time of writing, there are still no available primer sets specific to the Omicron variant, and its identification is only possible by using multiple targets, checking for specific failures, amplifying the suspect samples, and sequencing the results. This procedure is considerably time-consuming, in a situation where time might be of the essence. In this paper we use an Artificial Intelligence (AI) technique to identify a candidate primer set for the Omicron variant. The technique, based on Evolutionary Algorithms (EAs), has been already exploited in the recent past to develop primers for the B.1.1.7/Alpha variant, that have later been successfully tested in the lab. Starting from available virus samples, the technique explores the space of all possible subsequences of viral RNA, evaluating them as candidate primers. The criteria used to establish the suitability of a sequence as primer includes its frequency of appearance in samples labeled as Omicron, its absence from samples labeled as other variants, a specific range of melting temperature, and its CG content. The resulting primer set has been validated in silico and proves successful in preliminary laboratory tests. Thus, these results prove further that our technique could be established as a working template for a quick response to the appearance of new SARS-CoV-2 variants.


Subject(s)
COVID-19
2.
biorxiv; 2021.
Preprint in English | bioRxiv | ID: ppzbmed-10.1101.2021.01.20.427043

ABSTRACT

As the COVID-19 pandemic persists, new SARS-CoV-2 variants with potentially dangerous features have been identified by the scientific community. Variant B.1.1.7 lineage clade GR from Global Initiative on Sharing All Influenza Data (GISAID) was first detected in the UK, and it appears to possess an increased transmissibility. At the same time, South African authorities reported variant B.1.351, that shares several mutations with B.1.1.7, and might also present high transmissibility. Even more recently, a variant labeled P.1 with 17 non-synonymous mutations was detected in Brazil. In such a situation, it is paramount to rapidly develop specific molecular tests to uniquely identify, contain, and study new variants. Using a completely automated pipeline built around deep learning techniques, we design primer sets specific to variant B.1.1.7, B.1.351, and P.1, respectively. Starting from sequences openly available in the GISAID repository, our pipeline was able to deliver the primer sets in just under 16 hours for each case study. In-silico tests show that the sequences in the primer sets present high accuracy and do not appear in samples from different viruses, nor in other coronaviruses or SARS-CoV-2 variants. The presented methodology can be exploited to swiftly obtain primer sets for each independent new variant, that can later be a part of a multiplexed approach for the initial diagnosis of COVID-19 patients. Furthermore, since our approach delivers primers able to differentiate between variants, it can be used as a second step of a diagnosis in cases already positive to COVID-19, to identify individuals carrying variants with potentially threatening features.


Subject(s)
COVID-19
3.
biorxiv; 2020.
Preprint in English | bioRxiv | ID: ppzbmed-10.1101.2020.12.29.424715

ABSTRACT

The SARS-CoV-2 variant B.1.1.7 lineage, also known as clade GR from Global Initiative on Sharing All Influenza Data (GISAID), Nextstrain clade 20B, or Variant Under Investigation in December 2020 (VUI - 202012/01), appears to have an increased transmissability in comparison to other variants. Thus, to contain and study this variant of the SARS-CoV-2 virus, it is necessary to develop a specific molecular test to uniquely identify it. Using a completely automated pipeline involving deep learning techniques, we designed a primer set which is specific to SARS-CoV-2 variant B.1.1.7 with >99% accuracy, starting from 8,923 sequences from GISAID. The resulting primer set is in the region of the synonymous mutation C16176T in the ORF1ab gene, using the canonical sequence of the variant B.1.1.7 as a reference. Further in-silico testing shows that the primer set's sequences do not appear in different viruses, using 20,571 virus samples from the National Center for Biotechnology Information (NCBI), nor in other coronaviruses, using 487 samples from National Genomics Data Center (NGDC). In conclusion, the presented primer set can be exploited as part of a multiplexed approach in the initial diagnosis of Covid-19 patients, or used as a second step of diagnosis in cases already positive to Covid-19, to identify individuals carrying the B.1.1.7 variant.


Subject(s)
COVID-19
4.
biorxiv; 2020.
Preprint in English | bioRxiv | ID: ppzbmed-10.1101.2020.03.13.990242

ABSTRACT

In this paper, deep learning is coupled with explainable artificial intelligence techniques for the discovery of representative genomic sequences in SARS-CoV-2. A convolutional neural network classifier is first trained on 553 sequences from available repositories, separating the genome of different virus strains from the Coronavirus family with considerable accuracy. The networks behavior is then analyzed, to discover sequences used by the model to identify SARS-CoV-2, ultimately uncovering sequences exclusive to it. The discovered sequences are first validated on samples from other repositories, and proven able to separate SARS-CoV-2 from different virus strains with near-perfect accuracy. Next, one of the sequences is selected to generate a primer set, and tested against other state-of-the-art primer sets on existing datasets, obtaining competitive results. Finally, the primer is synthesized and tested on patient samples (n=6 previously tested positive), delivering a sensibility similar to routine diagnostic methods, and 100% specificity. In this paper, deep learning is coupled with explainable artificial intelligence techniques for the discovery of representative genomic sequences in SARS-CoV-2. A convolutional neural network classifier is first trained on 553 sequences from NGDC, separating the genome of different virus strains from the Coronavirus family with accuracy 98.73%. The networks behavior is then analyzed, to discover sequences used by the model to identify SARS-CoV-2, ultimately uncovering sequences exclusive to it. The discovered sequences are validated on samples from NCBI and GISAID, and proven able to separate SARS-CoV-2 from different virus strains with near-perfect accuracy. Next, one of the sequences is selected to generate a primer set, and tested against other state-of-the-art primer sets, obtaining competitive results. Finally, the primer is synthesized and tested on patient samples (n=6 previously tested positive), delivering a sensibility similar to routine diagnostic methods, and 100% specificity. The proposed methodology has a substantial added value over existing methods, as it is able to both identify promising primer sets for a virus from a limited amount of data, and deliver effective results in a minimal amount of time. Considering the possibility of future pandemics, these characteristics are invaluable to promptly create specific detection methods for diagnostics.

SELECTION OF CITATIONS
SEARCH DETAIL